智能论文笔记

GOF-TTE: Generative Online Federated Learning Framework for Travel Time Estimation

Zhiwen Zhang , Hongjun Wang , Jiyuan Chen , Zipei Fan , Xuan Song , Ryosuke Shibasaki

分类：机器学习 | 人工智能

2022-07-02

估计路径的旅行时间是智能运输系统的重要主题。它是现实世界应用的基础，例如交通监控，路线计划和出租车派遣。但是，为这样的数据驱动任务构建模型需要大量用户的旅行信息，这与其隐私直接相关，因此不太可能共享。数据所有者之间的非独立和相同分布的（非IID）轨迹数据也使一个预测模型变得极具挑战性，如果我们直接应用联合学习。最后，以前关于旅行时间估算的工作并未考虑道路的实时交通状态，我们认为这可以极大地影响预测。为了应对上述挑战，我们为移动用户组引入GOF-TTE，生成的在线联合学习框架以进行旅行时间估计，这是我）使用联合学习方法，允许在培训时将私人数据保存在客户端设备上，并设计设计和设计。所有客户共享的全球模型作为在线生成模型推断实时道路交通状态。 ii）除了在服务器上共享基本模型外，还针对每个客户调整了一个微调的个性化模型来研究其个人驾驶习惯，从而弥补了本地化全球模型预测的残余错误。％iii）将全球模型设计为所有客户共享的在线生成模型，以推断实时道路交通状态。我们还对我们的框架采用了简单的隐私攻击，并实施了差异隐私机制，以进一步保证隐私安全。最后，我们对Didi Chengdu和Xi'an的两个现实世界公共出租车数据集进行了实验。实验结果证明了我们提出的框架的有效性。

translated by 谷歌翻译

Route to Time and Time to Route: Travel Time Estimation from Sparse Trajectories

Zhiwen Zhang , Hongjun Wang , Zipei Fan , Jiyuan Chen , Xuan Song , Ryosuke Shibasaki

分类：人工智能

2022-06-21

由于物联网（IoT）技术的快速开发，许多在线Web应用程序（例如Google Map和Uber）估计移动设备收集的轨迹数据的旅行时间。但是，实际上，复杂的因素（例如网络通信和能量限制）使以低采样率收集的多个轨迹。在这种情况下，本文旨在解决稀疏场景中的旅行时间估计问题（TTE）和路线恢复问题，这通常会导致旅行时间的不确定标签以及连续采样的GPS点之间的路线。我们将此问题提出为不进行的监督问题，其中训练数据具有粗糙的标签，并共同解决了TTE和路线恢复的任务。我们认为，这两个任务在模型学习过程中彼此互补并保持这种关系：更精确的旅行时间可以使路由更好地推断，从而导致更准确的时间估计）。基于此假设，我们提出了一种EM算法，以替代E估计通过E步中通过弱监督的推断路线的行进时间，并根据M步骤中的估计行进时间来检索途径，以稀疏轨迹。我们对三个现实世界轨迹数据集进行了实验，并证明了该方法的有效性。

translated by 谷歌翻译

Event-Aware Multimodal Mobility Nowcasting

Zhaonan Wang , Renhe Jiang , Hao Xue , Flora D. Salim , Xuan Song , Ryosuke Shibasaki

分类：机器学习 | 人工智能

2021-12-14

作为一个决定性的部分，在移动式服务（MAA）的成功中，人群运动的时空预测建模是一个具有挑战性的任务，特别是考虑到社会事件驱动偏离正常性的移动性行为的情景。虽然已经进行了深入学习的高级时空态度，但大多数情况下都是巨大进展，如果不是所有现有方法都不知道多种传输模式之间的动态相互作用，也不是对潜在的社会事件带来的前所未有的波动性。在本文中，我们的动力是从两个视角改善规范时空网络（ST-Net）：（1）设计异质移动信息网络（Hmin），明确地在多模式移动性中明确代表差异; （2）提出内存增强的动态滤波器发生器（MDFG），以产生各种场景的动态方式生成序列特定参数。增强的事件感知的时空网络，即East-Net，在几个现实世界数据集中评估了各种各样的社会事件的繁多和覆盖范围。与最先进的基线相比，定量和定性实验结果验证了我们方法的优势。代码和数据在https://github.com/dunderdoc-wang/east-net上发布。

translated by 谷歌翻译

Differentiable Projection for Constrained Deep Learning

Dou Huang , Haoran Zhang , Xuan Song , Ryosuke Shibasaki

分类：机器学习

2021-11-21

深度神经网络（DNN）在解决各种领域的不同任务方面取得了非凡的性能。然而，传统的DNN模型通过损耗反向化稳定地接近地面真值。在某些应用中，可以容易地获得一些先验的知识，例如在遵循地面真理观察的约束。在这里，我们尝试提供一种普遍的方法来从这些约束中纳入信息以增强DNN的性能。从理论上讲，我们可以将这些类型的问题制定为KKT条件可以解决的受限优化问题。在本文中，我们建议在DNN中使用可分化的投影层，而不是直接求解耗时的KKT条件。所提出的投影方法可分辨，并且不需要重大计算。最后，我们还使用Pascal VOC DataSet使用随机生成的合成数据集和图像分割任务进行了一些实验，以评估所提出的投影方法的性能。实验结果表明，投影方法足够且优于基线方法。

translated by 谷歌翻译

Supervised Anomaly Detection Method Combining Generative Adversarial Networks and Three-Dimensional Data in Vehicle Inspections

Yohei Baba , Takuro Hoshi , Ryosuke Mori , Gaurang Gavai

分类：计算机视觉 | 机器学习

2022-12-22

The external visual inspections of rolling stock's underfloor equipment are currently being performed via human visual inspection. In this study, we attempt to partly automate visual inspection by investigating anomaly inspection algorithms that use image processing technology. As the railroad maintenance studies tend to have little anomaly data, unsupervised learning methods are usually preferred for anomaly detection; however, training cost and accuracy is still a challenge. Additionally, a researcher created anomalous images from normal images by adding noise, etc., but the anomalous targeted in this study is the rotation of piping cocks that was difficult to create using noise. Therefore, in this study, we propose a new method that uses style conversion via generative adversarial networks on three-dimensional computer graphics and imitates anomaly images to apply anomaly detection based on supervised learning. The geometry-consistent style conversion model was used to convert the image, and because of this the color and texture of the image were successfully made to imitate the real image while maintaining the anomalous shape. Using the generated anomaly images as supervised data, the anomaly detection model can be easily trained without complex adjustments and successfully detects anomalies.

translated by 谷歌翻译

GraphIX: Graph-based In silico XAI(explainable artificial intelligence) for drug repositioning from biopharmaceutical network

Atsuko Takagi , Mayumi Kamada , Eri Hamatani , Ryosuke Kojima , Yasushi Okuno

分类：机器学习

2022-12-21

Drug repositioning holds great promise because it can reduce the time and cost of new drug development. While drug repositioning can omit various R&D processes, confirming pharmacological effects on biomolecules is essential for application to new diseases. Biomedical explainability in a drug repositioning model can support appropriate insights in subsequent in-depth studies. However, the validity of the XAI methodology is still under debate, and the effectiveness of XAI in drug repositioning prediction applications remains unclear. In this study, we propose GraphIX, an explainable drug repositioning framework using biological networks, and quantitatively evaluate its explainability. GraphIX first learns the network weights and node features using a graph neural network from known drug indication and knowledge graph that consists of three types of nodes (but not given node type information): disease, drug, and protein. Analysis of the post-learning features showed that node types that were not known to the model beforehand are distinguished through the learning process based on the graph structure. From the learned weights and features, GraphIX then predicts the disease-drug association and calculates the contribution values of the nodes located in the neighborhood of the predicted disease and drug. We hypothesized that the neighboring protein node to which the model gave a high contribution is important in understanding the actual pharmacological effects. Quantitative evaluation of the validity of protein nodes' contribution using a real-world database showed that the high contribution proteins shown by GraphIX are reasonable as a mechanism of drug action. GraphIX is a framework for evidence-based drug discovery that can present to users new disease-drug associations and identify the protein important for understanding its pharmacological effects from a large and complex knowledge base.

translated by 谷歌翻译

P2Net: A Post-Processing Network for Refining Semantic Segmentation of LiDAR Point Cloud based on Consistency of Consecutive Frames

Yutaka Momma , Weimin Wang , Edgar Simo-Serra , Satoshi Iizuka , Ryosuke Nakamura , Hiroshi Ishikawa

分类：计算机视觉 | 机器人

2022-12-01

We present a lightweight post-processing method to refine the semantic segmentation results of point cloud sequences. Most existing methods usually segment frame by frame and encounter the inherent ambiguity of the problem: based on a measurement in a single frame, labels are sometimes difficult to predict even for humans. To remedy this problem, we propose to explicitly train a network to refine these results predicted by an existing segmentation method. The network, which we call the P2Net, learns the consistency constraints between coincident points from consecutive frames after registration. We evaluate the proposed post-processing method both qualitatively and quantitatively on the SemanticKITTI dataset that consists of real outdoor scenes. The effectiveness of the proposed method is validated by comparing the results predicted by two representative networks with and without the refinement by the post-processing network. Specifically, qualitative visualization validates the key idea that labels of the points that are difficult to predict can be corrected with P2Net. Quantitatively, overall mIoU is improved from 10.5% to 11.7% for PointNet [1] and from 10.8% to 15.9% for PointNet++ [2].

translated by 谷歌翻译

Surgical Skill Assessment via Video Semantic Aggregation

Zhenqiang Li , Lin Gu , Weimin Wang , Ryosuke Nakamura , Yoichi Sato

分类：计算机视觉

2022-08-04

基于视频的自动化手术技能评估是协助年轻的外科学员，尤其是在资源贫乏地区的一项有前途的任务。现有作品通常诉诸CNN-LSTM联合框架，该框架对LSTM的长期关系建模在空间汇总的短期CNN功能上。但是，这种做法将不可避免地忽略了空间维度中工具，组织和背景等语义概念之间的差异，从而阻碍了随后的时间关系建模。在本文中，我们提出了一个新型的技能评估框架，视频语义聚合（Visa），该框架发现了不同的语义部分，并将它们汇总在时空维度上。语义部分的明确发现提供了一种解释性的可视化，以帮助理解神经网络的决策。它还使我们能够进一步合并辅助信息，例如运动学数据，以改善表示和性能。与最新方法相比，两个数据集的实验显示了签证的竞争力。源代码可在以下网址获得：bit.ly/miccai2022visa。

translated by 谷歌翻译

Learning Deep Input-Output Stable Dynamics

Yuji Okamoto , Ryosuke Kojima

分类：机器学习 | 机器人

2022-06-27

从观察到的时间序列数据中学习稳定的动态是机器人技术，物理建模和系统生物学中的重要问题。这些动态中的许多被表示为与外部环境通信的输入输出系统。在这项研究中，我们专注于投入输出稳定系统，表现出对意外刺激和噪声的鲁棒性。我们提出了一种学习保证输入输出稳定性的非线性系统的方法。我们提出的方法利用了满足汉密尔顿 - 雅各比不平等的空间上的可区分投影来实现输入输出稳定性。找到该投影的问题可以作为二次约束二次编程问题，并分析得出特定的解决方案。此外，我们将方法应用于玩具双基生模型以及训练由葡萄糖胰岛素模拟器产生的基准测试的任务。结果表明，通过我们的方法，具有神经网络的非线性系统可以达到输入输出稳定性，这与天真的神经网络不同。我们的代码可在https://github.com/clinfo/deepiostability上找到。

translated by 谷歌翻译

Replacing Labeled Real-image Datasets with Auto-generated Contours

Hirokatsu Kataoka , Ryo Hayamizu , Ryosuke Yamada , Kodai Nakashima , Sora Takashima , Xinyu Zhang , Edgar Josafat Martinez-Noriega , Nakamasa Inoue , Rio Yokota

分类：计算机视觉 | 人工智能 | 机器学习

2022-06-18

在目前的工作中，我们表明，公式驱动的监督学习（FDSL）的表现可以匹配甚至超过Imagenet-21K的表现，而无需在视觉预训练期间使用真实的图像，人类和自我选择变压器（VIT）。例如，在ImagEnet-21K上预先训练的VIT-BASE在ImagEnet-1K上进行微调时，在ImagEnet-1K和FDSL上进行微调时显示了81.8％的TOP-1精度，当在相同条件下进行预训练时（图像数量，数量，，图像数量，超参数和时期数）。公式产生的图像避免了隐私/版权问题，标记成本和错误以及真实图像遭受的偏见，因此具有巨大的预训练通用模型的潜力。为了了解合成图像的性能，我们测试了两个假设，即（i）对象轮廓是FDSL数据集中重要的，（ii）创建标签的参数数量增加会影响FDSL预训练的性能改善。为了检验以前的假设，我们构建了一个由简单对象轮廓组合组成的数据集。我们发现该数据集可以匹配分形的性能。对于后一种假设，我们发现增加训练任务的难度通常会导致更好的微调准确性。

translated by 谷歌翻译